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ABSTRACT 

In this thesis we extend several results on prediction 
intervals that were obtained under the assumption that the 
sample observations are independent and identically 
distributed as N(y,a ). We assume that the samples are 
correlated with a prescribed correlation structure and show 
that many of the results available for the independent case 
apply equally well for the correlated samples. The 
correlation structure assumed occurs in variance components 
models in Analysis of Variance and the results can also be 
applied to the case where the samples have an intraclass 
correlation (equicorrelated samples) . 
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I. INTRODUCTION 



A prediction interval is a random interval that contains 
the value of a future observation or some function of 
future observations and whose end points are functions of 
previous sample values. Such an interval provides an 
indication of the uncertainty in the future observations. 
More specifically, a lOOr percent prediction interval for 
the value of a future sample is an interval that is based 
on a previous sample and encloses the future observations 
with probability r, independent of the values of the 
distribution parameters, such as the mean or the standard 
deviation. A prediction interval needs to be distinguished 
both from a confidence interval and a tolerance interval; 
a confidence interval encloses the value of an unknown 
parameter and a tolerance interval is an interval within 
which a specified proportion of the population values will 
lie with a specified probability. 

In many practical problems, it would be of interest to 
construct a prediction interval for the values of the next 
k sample values from a population. For example, if only 
one machine is available for testing and we must perform 
trials sequentially, a prediction interval could provide 
helpful information about the total time needed to complete 
the experiment or perhaps the number of trials it would be 
possible to perform. Another application of prediction 
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intervals is in forecasting before a planned experiment is 
completed. In an experiment where each observation is 
expensive or where they can be made only infrequently, 
prediction intervals may be helpful in reaching a decision 
on the profitability of continuing the experiment at inter- 
mediate points in the experiment. For example, when the 
experiment concerns a physical input or output, preliminary 
estimates of the ultimate amount of needed input material 
or of the ultimate storage needed for the output might be 
helpful. In other situations where the random variable is 
the "time until occurrence of an event," and where physical 
limitations prevent the concurrent running of all planned 
trials, prediction intervals might provide helpful infor- 
mation concerning the total time until completion of the 
planned experiment. Prediction intervals are also of 
frequent interest to a typical consumer of one or a small 
number of units of a given product. Such an individual is 
generally more directly concerned with the future performance 
of his specific sample than in the process from which the 
sample had been selected. A prediction interval to contain 
each of the values of the sample would then provide him 
with an interval within which he may expect the performance 
of all his units to be located with a high probability. 

Based upon his experience with a previous sample of 10 light 
bulbs, a consumer might wish to construct an interval which 
would have a high probability of including the performance 
values of each of three additional bulbs. 
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In this thesis we derive prediction intervals for one 

future sample observation as well as simultaneous intervals 

for a specified number of future sample observations when 

the samples are correlated. These results are obtained as 

extensions of results due to Hahn C5l* He derived similar 

intervals for the case where the samples are independent and 

2 

identically distributed as N(y,a ). 

In Chapter III it is shown that Hahn’s prediction inter - 
val for the standard deviation of a single future sample is 
valid even in the case where the sample values are correlated 
and have a multivariate normal distribution with mean vector 
^ = (y,y,y,. . . ,y)' and covariance matrix V having the following 

structure ; 



V = | ( H + H«) + 

nxn nxn nxn 




H’ is the transpose of H , 
nxn nxn 

a are positive constants, I_ 

nxn 



( I - E ) (l.D 

nxn nxn 




h ± (i=l,2,3,... ,n) and 
is an' nxn identity matrix. 
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and 



E 

nxn 



is an nxn matrix all of whose elements are 



unity. 

Simultaneous prediction intervals for the standard 
deviations of k future samples are also derived and examples 
illustrating the results are provided. 

A covariance matrix with the above structure occurs in 
the study of random effects models in analysis of„variance. 

p 

If s ampl es are drawn from a normal distribution-N-(y , 0 — ) 
and it is assumed that y it self i s normally distributed 
as N(n,0^ ) ^hen it can be shown that the sample values 
have a multivariate normal distribution with mean vector 
n = (n,n ,n , . . . ,n) ' and covariance matrix 



It can be seen that the matrix V has the same structure 



possible application of the results of this thesis is in 
the following situation. From a lot containing a large 
number of guns n are selected at random. Each of these 
guns is then fired k times and the resulting miss distances 



V = 




as in (1.1) by letting h-^hg 




n y 
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from a target are measured. Based on the mean of the 
measured miss distances, a prediction interval for the 
miss distance for a randomly chosen gun may be predicted. 

Chapter IV deals with procedures for constructing a 
prediction interval to contain a single additional observa- 
tion and also with constructing a simultaneous prediction 
interval to contain all k additional future observations, 
when the samples are correlated and the covariance matrix 
has the structure as in equation (1.1). 
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II. SUMMARY OP KNOWN RESULTS 



A. DEFINITIONS AND NOTATIONS 

,k and j = l ,2 , 3 , . . . ,n^, be k+1 sets 
of random samples of size n^ from a normal distribution 



Let X ±J , i=0 ,1 ,2 , 3 } 



N(p,o^). The n Q samples for i=0 are considered as the 
given sample and the remaining k sets are future samples 
for which prediction intervals are needed. 

Let 

1 1 

X. = -±- Z X., 

1 n i j=l 1J 

n i 

and S. 2 ='-f- Z (X,. - X,) 2 
i j = l J 



where i=0,l,2 ,3. . . ,k and j=l,2,3, . . • ,n^. 



B. PREDICTION INTERVALS FOR THE STANDARD 
DEVIATIONS OF FUTURE SAMPLES 

It is well known that n o S Q 2 /a 2 and n^S^/o 2 (i=l,2,3. . . >k) 

have a Chi-square distribution with n -1 and n^-1 degree 

of freedom respectively and they are mutually independent. 

2 2 

Thus, S. /S. follows an F distribtuion with n.-l and 
n -1 degree of freedom respectively, i = l ,2 , 3 , . . . } k . 

Therefore, a prediction interval to contain the standard 
deviation S^ of a single future sample of n^ observations is 

- T 
L 
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Pr^FC^-l^-l; (1-^/2)** < S 1 < S o F(n 1 -l,n 0 -l;(l+r)/2) , ' 5 }= r 

( 2 . 1 ) 

«.VI 

where F(n i ~l J n o -l ; (l-r)/2) and FCn.^-1 ,n Q -l; (l+r)/2) 
are lower and upper 100 r$ points of F distribution with 
n^-1 and n -1 degree of freedom respectively. 

A two-sided 100 r$ prediction interval to contain the 
standard deviation of a single future sample of size n^ 
is 

(F (n^- 1 ,n Q -l ; ( 1 -r )/ 2 ) ^S 0 , F(n.-l,n 0 -l; ( 1 +^/ 2 )^) ( 2 . 2 ) 

To obtain a simultaneous interval to contain the 
standard deviations of k future samples assume that 
n i =m, 1=1,2 ,3, . . . ,k, and let 

S i 2 

max —^5- = W T (K,m-l,n - 1 ) 

1 S o 

S i 2 

and min — ^ = Wg (K ,m-l ,n Q -l) 

1 S o 

The random variables W T (K,m-l,n - 1 ) and W c (K,m-l,n - 1 ) are 

Li O O O 

known as the studenti zed largest and studentized smallest 
Chi-square variates, respectively, in the statistical 
literature and some tables [1] of the percentage point of 
their distributions are available. Let Dy (K,m-l,n Q -l;r) 
and D^(K,m-l,n 0 -l,l-r) denote the upper 100r$ and the lower 
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100(l-r)% points of the distribution of W T (K,m-l,n -1) 

-Lf O 

and Wg (K ,m-l ,n o ~l) , respectively. 

Then 

Pr{max S.^ < D TT (K,m-l,n -l;r)S ^} = r 
. l — U o o 



and 



Pr{min S. > D T (K,m-l,n -l,r)S^^} = 1-r 

4 1 — L O O 



( 2 . 3 ) 



A simultaneous prediction interval to contain all the 
standard deviations jSg ,S^. . . ,S^ is given by 

(D TT (K,m-l,n -l;r)^S , D T (K,m-l,n -l;l-r)^S ) 

U O O ij o o 



C. PREDICTION INTERVALS FOR THE OBSERVATIONS IN A FUTURE 
SAMPLE 

Let X^ ,X^ } • • • >X n be the values of n given samples 

2 

from a normal distribution N(y,a ) and let > x n +2 ’^+3’ 

. . . >X n+ ^ be the values of k future independent observations 
to be drawn from the same distribution. To get a prediction 
interval to contain a single additional observation X n+ ^, 
we proceed as follows; 

Let 

Z = X - X 
1 n+1 o 

the expected value of is zero and the standard deviation 

of ^ is Z, ~ 
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It is easily seen that that the standardized variable 



Z 1 - 0 



1 if 



a 2 U + i) 



X .. - X 
= n+1 o 

1 % 

aa+-) 



and 



p 1 n p 

S = - l (X.-X ) 

o n i _ 1 l o 



are independent. Therefore, 






T = 



(n-l)S 



o 



X n + 1 - X o 

S o a+ E )h 



o (n-1 ) 



follows a t distribution with n-1 degrees of freedom, 
Thus, 



' y Pr{X Q +t (n-1 ; (l-r)/2) ( 1+1/n ) iSs 0 <x n+1 < x 0 +t (n-1 ; (l+r)/2)(l+l/n)S}=r 

(2.4) 



where t (n-1; (l-r)/2 ) and t (n-1; (l+r)/2 ) are lower and upper 
lOOr# points of t-distribution with n-1 degrees of freedom. 

Hence, a two-sided lOOr# prediction interval to contain 
a single future observation 
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is 



X^ ± t (n-l|: ; (l+r)/2 ) (l+l/rO^S 
O' o 

To determine a simultaneous prediction interval to 
contain all k future observations, first, let 



z ± Z n+ ^ X o , i 1>2 ,3, . . . ,k. 

Then, the expected value of Z i is zero and the variance of 
Z ± is 



« 2 a + i) 



and it can be shown that cov(Z.,Z.) = c / n for ail i and j, 





i ^ j . The transformed variables 



aCl+i)** 

n 



*n+i - X o 



i 1 , 2 , 3 , • • • ,k 



have standard normal distributions. 

2 2 * 

Since (n-l)S^ /a is independent of the Z^ and has a 



Chi-square distribution with n-1 degrees of freedom, each 
of the ratios 



T . = 



Z . 

l 



(n-l)S 
(n-1) a 



X n + i - 

S (1+-) 15 
o n 
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follows a student's t-distribution with n-1 degrees of 
freedom and the are correlated. The random variables 
T^ ,^2 ,T^ , . . . jTp. are jointly distributed according to the 
multivariate generalization of the student's t-distribution 
with n-1 degrees of freedom. Tables of the percentage 
points of this distribution are given in [4]. If u is 
defined as the solution of the integral equation 



u 



u 



u 



r = I S'" f f n 



-u -u 



-u 



mm m dT-. , dTp , dT p J . . . 

^ * j-2 3 3 5 * * * 5 K ± 



, dT 



K 



'mm m is the joint probability density 

1* 2* 3* * * * K 



where f^ 

function of multivariate t-distribution with n-1 degrees 
of freedom, then ■ 



Pr{X -ud+i)^ <X ,, <X +u(l+i-)" 2 S 



n+1 



n 



,X -uCl+^S <X <X +u(l+^) i % } = r 
5 o n o n+k o n o 



The resulting 100r$ simultaneous prediction interval to 

contain the values X , , ,X ,~,X X .. of all k additional 

n+1* n+2’ n+3* * n+k 

observations is 

X o ± u(l+^) 5s S o (2.4) 

D. SOME THEOREMS USED IN DERIVING THE RESULTS IN THE THESIS 

* Theorem 1 . If X is distributed N(y,a^I), then X'AX/a^ 

2 2 

is distributed as x (K,X), where X = y ' Ay/2a , and k = rank 
of A, if and only if A is idempotent. 
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* Theorem 2 . If X is distributed N(£,V), then X’BX is 

2 

distributed as x (k,X), where X = and k is the rank 

of B, if and only if BV is idempotent. 

* Theorem 3 » If X is distributed N(jj_,V), then X * AX and 
X'BX are idependent if and only if AYB = 0. 



* Theorem H . If X is distributed N(£,V), then Y = C.’ X and 
X* AX are independent if and only if C’VA = 0. 

* Theorem 5 . (Hogg and Craig theorem) 

Let Q = Q]_ + Q 2 + Q 3 + • • ,+ ^k-l + ^k’ where Q ^ ,Q 2 5 Q 3 > • • • j^k-i > and 
Q k are k +1 random variables^ that are quadratic forms in the 

observa tions o f a random sample o f size n fr om a normal 
distribution N(p,cr^). Let Q/o^ be x^( r )j let Q^/o^ be 
X (r.), i=l , 2 , 3 , . . . ,k-l , and let Q. be non-negative. Then 

1 rC 

the random variables Q, ,Q ,Q_ , . . . ,Q. are mutually stochas- 

2 2 k ~ X 

tically independent and, hence, Q k /a is x (r k = r - Z r . ) . 

J I 

* Theorem 6 . (Baldessari theorem). Let X be a 

nxl 

multivariate normal distribution with mean vector jj and 

nxl 

covariance matrix V , i.e., N(jj,V), and Bq ,B^ jB^ , . . . ,B k 

nxn 

be (nxn) idempotent matrices satisfying 



' Z B, = I - i E 

4 o J n 

j = 0 0 nxn nxn 



where I is the (nxn) identity matrix and E is 
nxn nxn 

a (nxn) matrix all of whose elements are unity. Let 

a be a positive constant. Then, a necessary and sufficient 
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condition for X’BjX/a, j=l ,2 , 3 , . . . ,k , to be mutually 
independent and have non-central Chi-square distribution 
with r^ (r^ = rank of , j = 0 ,1 ,2 , . . . ,k) degree of freedom 
is that the covariance matrix V has the following structure 

V = i( H + H' ) + a( I - E ) 

nxn ^ nxn nxn nxn nxn 

where H , I and E are defined on (1.1). 

nxn nxn nxn 
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III. PREDICTION INTERVALS TO CONTAIN THE STANDARD DEVIATIONS 
OF FUTURE SAMPLES - CORRELATED CASE 



Hahn [5] derived prediction intervals to contain the 
standard deviations of future samples of independent and 
identically distributed random variables from a normal 
distribution with unknown mean and unknown standard devia- 
tion. In this chapter we extend Hahn's results to the case 
where the samples are correlated and have a special type of 
covariance structure. 

Section A deals with the procedures for constructing a 
prediction interval to contain the standard deviation of a 
single future sample of size n^ observations, based on a 
given sample of size n . 

Section B deals with the construction of simultaneous 
prediction intervals to contain the standard deviations S^ 
i = l,2,3,...,k of k future samples of sizes n^. 

Numerical examples are given in Section C. 

A. PREDICTION INTERVAL TO CONTAIN THE STANDARD DEVIATION 

OF A SINGLE FUTURE SAMPLE 

Let x oi ,X 02 > x o' > * ’ ’ ’ ,X 0n be the values of a given sample 
3 n o 

and X.^ ,X 12 ,X 13 } ’ * * ,X ln * va ^ ues a f uture sample. It 

is required to construct a prediction interval for the 
standard deviation S^ of the future sample based on the 
standard deviation S Q of the given sample. 
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Let 



n. 



X, - — E X, . 
i n ± J = 1 io 



and 



n , 



n 



Z (X. . - 



i J=1 



ij 



X ± ) 2 where i 



= 0,1 



f 



-^■ir 



Let 



and 



_ i 1 

x ' n if 0 ^ X ij where N = % + n : 



l n i 

2 1 J. -L 

s = w 1 

i=0 j=l 



Z (X ±j . - X)‘ 



denote the sample mean and variance of the combined sample 

of size N = n + n n . 

o 1 

_ _ _ p 

Since X^ = (NX - n o X o )/n^, the sum of squares NS can be 

partitioned as follows: 

p 1 n i _ o 1 n i _ _ _ p 

•NS = E Z (X..-X) = E E (X. .-X.+X.-X) 

i=0 j-1 ij i=0 J-l 1J 1 1 

i n • 

1 1 — 2 — — 2 
= E E (X. .-X, r + n.(X,-Xr 
i=0 j=l 1J 1 11 

" n o S o 2+n l S l 2+n o (X o- X)2+n l' (NX - n o X o )/n l- X)2 



p p ^ o ^ p 

=• n o S o + n l S l + lb (X " X o } 



( 3 . 1 ) 
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Expressing the sum of squares in (3*1) as quadratic forms 
we can write the equation as 



X'B X = X 'B, X + X’B„X + X'B_X 

— — o— — —1— — —2— 3 _ 



(3-2) 



where B^, B^ and are Idempotent matrices and 



B = I - N 1 E 
NxN NxN 



and 



-1 T 



B ± = I - n^ E , 1=0,1. 



n^xn^ 



n^xn^ 



If X 0 1 ,X 02 ,X 03 5 



,X 0n o 5 X 11 ,X 12 5X 13. , ' ‘ ‘ ^ln.,^ 



are 



independent and have identical normal distributions with 

2 2 2 2 2 
mean y and variance a 5 then it is known that NS /a . ri S /a 

J J o o 

2 2 

and n n S n /a have Chi-square distributions with N-l, n -1 

_L JL O 

— — 2 

and n^-1 degrees of freedom respectively and n Q N(X-X o ) /n^ 
is non-negative. Thus, Hogg and Craig’s theorem (theorem 5) 
applies to equation (3.1). Therefore, the three quadratic 
forms on the right hand side of (3-1) are mutually independent 
and n Q N(X-X Q ) /n^o has a Chi-square distribution with 1 
degree of freedom. It also follows that the matrix B^ in 



equation (3-2) is also idempotent 
Now, suppose X 
is a' vector random variable having a multivariate normal 



' X 01’ X 02 ,X 03’’ ' ' ,X 0n ’ X 11’ X 12’ X 13’ 



’ X lru > 



distribution with mean £ = (y,y,y, . . . ,y) and covariance 

Nxl 

matrix V which has the following structure. 

NxN 



V 

NxN 



■ ¥»- 



+ H ) + a( I - E ) 
NxN NxN NxN NxN 



(3-3) 
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where 



H 

NxN 



h. 



\ h 



\ 



N h N h N 



h N / 



I is an NXN identity matrix, E is an NXN matrix whose 
NXN NXN 

elements are all unity and a and tm, i = 1,2,3,...,N, are 
positive constant. 

To obtain a prediction interval for S^, we start with 
equations (3*1) and (3*2). Since the matrices B , B , B ? 

3 U L -L 

and B_ are idempotent matrices and Bq = Z B. = I - N E 

3 NxN j = l J NxN NxN 

all the conditions of the Baldessari theorem (theorem 6) 

are now satisfied. Therefore, the three quadratic forms of 
(3.2) on the right hand side have central Chi-square distri- 
butions with n Q -l, n^-1 anc ^ 1 degree of freedom respectively 
and are mutually independent. 

Thus, the random variable 



X B 2 X / X B 1 X 



(n l' 1)S l 2 / (n o- 1)S o 2 



a 



a 



a(n 1 ~l) 



a(n Q -l) 



follows an F-distribution with n^-1 and n Q -l degrees of 
freedom and we obtain a prediction interval for as; 
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WP^-l.yljd-r)^) < < F(n^-l,n o -l; (l+r)/2) } = r 

S o 

or equivalently, 

Pr{S o F(n 1 -l,n o -l;(l-r)/2) ls < S 1 < S^Cry-l^-l; {Y+v)/2) h } = r 

(3.4) 



where r is the chosen confidence coefficient and 
FCn^ljn^l; (l-r)/2) and F(n 1 -l,n 0 -l j (l+r)/2) are the 
appropriate percentage points of the F distribution with 
n^-1 and n Q -l degrees of freedom. This yields the following 
two-sided 100r$ prediction interval to contain the standard 
deviation S^ of n^ future observations; 



^ S o*-F(n o -l,n 1 -l; (l+r)/2)^» F(n l 1,n o" 1; (1+r)/2) "V ( 3 • 5 ) 



This prediction interval for is exactly the same as the 
one obtained by Hahn [5] for the independent case. 



B. A SIMULTANEOUS PREDICTION INTERVAL TO CONTAIN THE 
STANDARD DEVIATION OF EACH OF k FUTURE SAMPLES 

.As in the previous section, let ^oi ,Xq 2 j^03‘ * * 5 ^0n 
be the values of a given random sample and let 



^11 ’ ^1 2 3 ^1 3 * * ’ **^ ln - 



. ,X 



, X n -i jX-jojX-—,. • • ,x 
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Let 
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be the mean and variance of the pooled sample of N 
observations . 
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The sum of squares NS can be partitioned as 
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(3.6) 
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It follows that (N-l)S /a has a Chi-square distribution 

2 2 

with N-l degrees of freedom and (n^-l)S^ /a , i=0 ,1 ,2 , 3 , . . . ,K, 

have Chi-square distributions with n.-l degrees of freedom 

respectively, and the last term of (3.6) is non-negative. 

Applying Hogg and Craig theorem (theorem 5) we can conclude 

that the last term of (3*6) also has a Chi-square distribu- 

K 

tion with K[(N-l)- I (n.-l) = (N-l)-(N-(K+l) )=K] degrees 

i=0 

of freedom and that all the sums of squares on the right 
hand side of equation (3.6) are mutually independent. 
Expressing these sums of squares as quadratic forms we 
can write equation (3.6) as; 

X'BX^ = X'^X + X'B^X + X'E^X + ... + X’RX + X'R^X (3-7) 



where 



B= I -N _1 E and B . = I - n~\ , i=0,l,2, . . . ,K,K+1, 

NxN NxN 1 n 1 xn i 1 n i xn 1 



are idempotent matrices (see theorem 1). 

Now, suppose X is a random vector having a multivariate 
Nxl 

normal distribution with mean y = (y ,y ,y, . . . ,y) ' and 

Nxl 

covariance matrix V which has the form (3-3). 

NxN „ 

p 

The partition of NS given in equations (3.6) and (3-7) 

are valid for this case also. Thus, we know B and B^, 

i=0,l,2,3j . • • jK, are idempotent matrices and B = I_ - N 

NxN NxN 
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So, the conditions of Baldessari 



rij> Aii^ AA i 

theorem (theorem 6) are all satisfied. Therefore, X BX/a 
has a Chi-square distribution with N-l degrees of freedom, 

X' B^X/a, i=0,l,2,3,... 5 K, have Chi-square distributions 
with n i -l degrees of freedom and X’B^X/a has a Chi-square 
distribution with K degrees of freedom. Further the k+2 
sums of squares on the right hand side of (3-7) are mutually 
independent. Thus, each of the random variables 







a 



< n i- 1)S l 2 / (n Q- l)S 0 2 



a(.n^-l) , 



a(n Q -l) 




where i = l,2,3,...,k, follows an F distribution with n ± -l 

and n-l degrees of freedom. 

Now, assume n x = n 2 = n 3 = •** = n k = m ‘ Then ’ 
S. 2 /S q 2 , i=l ,2 , 3 , . . • ,K, has an F distribution with m-1 and 

n-l degrees of freedom. 

Define the random variables 



W L (K,m-l,n o -l) 
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and 



W„(K,m-l,n o -l) = min — j 3 
^ i S _ 



i=l,2 ,3, • • • *K. 
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The distributions of W T (K,m-l,n -1) and W c (K,m-l,n -1) 
are known as the studentized largest and studentized 
smallest Chi-square distributions, respectively. The upper 
percentage point D T7 (K,m-l ,n -l;r) of W (K,m-l,n -1) and 
the lower percentage point D T (K,m-1 ,n -1 ; 1-r) of W c (K,m-l,n -1) 

O o O 

were tabulated by Armitage, J.V. and Krishnaiah, P.R. and 
are available in [1]. 

Then, 

Pr{W L (K,m-l,n o -l) <_ Dy(K,m-l,n o -l;r) } 
s i 2 

= Pr{max — ~ £ D n (K,m-l,n -l;r)} = r 

i S u ° 

o 

and 



Pr{max S^ 2 £ D u (K,m-l,n o ~l;r)S o 2 } = r. 

Thus, an upper 100r/S simultaneous prediction limit to 
exceed the standard deviations of all k future samples 
each of size m is 

S o D u (K,m-l,n o -l;r) Js (3-8) 

Similarly, a lower 100 (l-r)$ simultaneous prediction limit 
to be exceeded by the standard deviations of each k future 
samples of size m is 
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S 0 D L (K,m-l ) n 0 -l,l-r) Jl (3-9) 

This result is also the same as the one Hahn [5] obtained 
for independent samples. 

C. NUMERICAL EXAMPLES 

Suppose a gun is selected at random and fired n o = 6 
times and the resulting miss distances from a target are 
measured. Let S Q = 1.00 be the standard deviation of these 
observations. A prediction interval for the standard devia- 
tion S 1 of n^ = 10 future attempts is desired. 

Then, a two-sided 95% prediction interval to contain 
the standard deviation S^ for a single future sample of 10 
observations is obtained as follows; 

For n = 10, n = 6 and r = 0.95, E(9,5;-975) = 6.68 and 

1 O ^ j 

F(5,9;0.975) = 4.48 and S o F(9, 5 ;0. 975)"^ = (1. 00) (6. 68) 2 = 2.584 
S o F(5,9;0. 975 ) _Js = (1.00)(4.48)-* = 0.472. 

Substituting the above in equation (3-5) the required 
prediction interval for is (0.472,2.584). Next, an upper 
95# simultaneous limit to exceed the standard deviation of 

all 3 future samples of size 10 is; 

For m = 10, n Q = 6 , K = 3 and r = 0.95, D y (3, 9 ,5 ; 0 . 95 ) * 6.41 

and D u (3,9,5;0.95) 5s S = (6. 4l) 1 *(1.00) = 2.762 (see table 28 
page 4l [1 ] ) . 
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IV. PREDICTION INTERVALS FOR THE ADDITIONAL 
OBSERVATIONS IN A FUTURE SAMPLE - 
CORRELATED CASE 

A prediction interval to contain a single future 
observation and a simultaneous interval to contain each 
of k additional observations of a random sample from a 

p 

normal distribution with mean y and variance a were obtained 
by Hahn [5]* In this chapter we extend these results to the 
case where the samples are correlated and the covariance 
matrix has the form defined in (3.3). In section A a 
prediction interval to contain a single additional observa- 
tion based on correlated observations is obtained, and 
section B deals with the construction of simultaneous 
prediction intervals to contain k additional correlated 
observations. Numerical examples are given in section C. 

A. A PREDICTION INTERVAL FOR A SINGLE FUTURE OBSERVATION 

Let X-^ ^2 >X^ , . . . ,X n be indegen^ant and have identical 
normal distribution with unknown mean y and unknown 
standard deviation a. It is required to construct a 
prediction interval for an additional observation X n+1 
based on the given n samples. 

Let 
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p 1 n — ? 

S n n Z (X 1 “* X n) 

n n l n 
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p *| n+1 2 

5 n+l = n+1 ± l 1 (X i " X n+1 ) 



Since 



X n+1 “ (n+1)X n+l ' nX n » - ><"■+> - ^ ^ 



n+1 



n 



the sum of squares (n+l)S^ + ^ can be partitioned as follows: 
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TT .,2. 
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c 2 j. n+1 n.,2 r n+1 n i 

' + n+1 > + { n+1 



= nS 2 + -5, - (X - X ) 2 ^ 
n n+1 n+1 n 



(4.1) 



Expressing the sum of squares in (4.1) as quadratic forms 
we get 



X'B X = X ' B, X + X ’ B„X 

— — o— — —1— — —2— 



(4.2) 



30 



where and are idempotent matrices and 



B = I - (n+1) 1 E and B = I - n 1 E. 

(n+Dx(n+l) (n+l7x(n+l) nxn nxn 



2 2 2 2 

Since (n+l)S n+ ^/a and nS n /a have Chi-square distribu- 
tions with n and n-1 degrees of freedom respectively and 
n(X n+1 - X R ) /n+1 is non-negative , Hogg and Craig’s theorem 
(theorem 5) applies to equation (4.1). Therefore, the two 

quadratic forms on the right hand side of (4.1) are mutually 

J — 2 2 

independent and n(X^ +1 - X n ) /(n+l)a has a Chi-square 

distribution with 1 degree of freedom. It also follows 

that the matrix B ^ in equation (4.2) is also idempotent. 

Now, suppose X is a vector random variable having 
(n+l)xl 

a multivariate normal distribution with mean y = (y ,y , y , . . . , y ) ' 

(n+l)xl 

and covariance matrix V which has the following 

(n+lTx(n+l) 

structure . 



V = |( H + H’) + a ( I - E ) (4.3) 

(n+l7x(n+l) (n+l7x(n+l) (n+l)x(n+l) (n+l)x(n+l) (n+l)x(n+l) 



where 



H 

(n+1 )x (n+1) 
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V 



( T ^ 
0* 



L.*t * ° «.A — Ulr ^ -T- ,7 ^ 0^.^ 

or ^ j>« (jl -j " ^ o 
v Ch s, ? ck. Jd 4 ; •<-.# cks*~ j., J -* 1 



f\ (JLQ v V^CxoVs ^ °* 



H* is the transpose of H, a and h i , i=l ,2 , 3 , . . . ,n+l are 
positive constant, I_ is an (n+l)x(n+l) identity matrix and 
E is an (n+l)x(n+l) matrix whose elements are all unity. 
Since the matrices Bq , B ^ , and B^ in equation (4.2) are 
idempotent and 

2 

(n+l)x?n+l) " — j - (n+ljx(n+l) ” ^ n+1 ^ (n+l)x(n+l) 

we may apply the Baldessari theorem (Theorem 6) to equation 
(4.1) to conclude that the two quadratic forms on the right 
hand side of the equations have centr al Chi-square distri- 
bution with n-1 and 1 de gree of f x£Bdom--reapectively and 
a re mutu ally independent. 

Thus, the random variable 



X'B 2 X J X'^X 
a / a 



>2 



^n+l )(X n+l X n^ 



a 



(n-wy 

a(n-l) 



has an F distribution with 1 and n-1 degree of freedom. 
We obtain a prediction interval for X n+1 as 



PrfF(l,n-l; (l-r)/2) < < F(l,n-1; (l+r)/2)} = r 
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or equivalently 



Pr{ V S n ( : 1 + ¥ ^ 1 ’ n_1 > ( 1_r/2 ) ^ < ^n+i < 

< V S n (1 + n )J5p(ljn_1;(1+r)/2)}5} = r 

where r is the chosen confidence coefficient and 
F( 1 ,n-l ; (1+r )/2 ) and F(l,n-1; (l+r)/2) are the appropriate 
percentage points of the F distribution with 1 and n-1 
degree of freedom. 

Now recall that F(1 ,n-l ; (1-r )/2) ** = t (n-1 ; (1-r )/2 ) 

and F(l,n-1;( 1+r )/2 ) ^ = t (n-1; (1+r) /2) 

This yields the following two-sided prediction interval 
to contain the additional observation X n+ ^ : 

C\+S n ( 1 + ^ (n-1 ; (1-r ) /2 ) ,^+S n (1 + \ (n-1 ; (1+r ) /2 ) ) (4.4) 



B. SIMULTANEOUS PREDICTION INTERVALS FOR k 
FUTURE OBSERVATIONS 

•Let ,X 2 ,X^ , • • • be the values of a given sample and 



X , n ,X , OJ X X , the values of k future observations. 

n+1’ n+2 5 n+3 n+k’ 

We assume that the sample observations X-^ ,X 2 ,X^ , . . . ,X n , 

X n+1 jX n+2 , . . . ,X are correlated and have a multivariate 
normal distribution with mean y = (y ,y , y , . . . ,y ) ' and 



covariance matrix V which has the form (4.3). 
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In order to construct a simultaneous prediction interval 
*' or X n+1 ,X n+2 ,X n+3 3 ’ ’ ’ >X n+k we first establish that 

n (X i" Y n )2 nS 2 

(i) I = has a Chi-square distribution 

i-1 a a 

with n-1 degree of freedom, 

(ii) the vector variable Z_ = (Z^,Z 2 ,Z 3 , . . . ,Z k ) 1 , where 

Z> = X ,, -X , has a multivariate normal distribu- 
i n+i n ’ 

tion and 

o 

(iij) the vector variable and nS are statistically 
independent . 

2 n - 2 

If nS = E (X. -X ) is expressed as a quadratic form 
i=l 1 n 

X’BX, where X = (X 1# X 2 ,X 3 , . . . ,X n ,X n+1 ,X n+2 , . . . ,X n+k ) * , 
then a necessary and sufficient condition for X'BX to have 
a Chi-square distribution is that BV is idempotent (see 
Theorem 2). 

To show that BV is idempotent, let the matrices H, H * , 
and E (*1.3) and the matrix V be partitioned as follows: 
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and 



V = 



/ 



/ 



' *1 
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nxk 
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kxk 


' £1 


—2 


nxn 
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-3 
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kxk 



/ 



I 
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I = 



\ 



h 


0 \ 


nxn 




0 


-H 1 




kxk j 



where + a(^-E^) 



and 



-2 " | ( ^2 + -3' ) + a( °-E 2 ) 



We know 



B 

(nxk)xTn+k) 



*1 


0 \ 


nxn 




- 




0 

l 


0 



where B, = - ( I 

— 1 CL — 

nxn nxn 



and BV 
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0 


0 


0 



1 E 
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nxn 




/ v n 
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—2 


, V 0 


V,. 


-3 


—4 



B. V, 


B. V„ \ 


-1-1 


-1-2 
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0 / 




(4.5) 
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In equation (4.5), and B^V 2 can be simplified as 

follows : 



B 1 V 1 = ^ ( -l" n 1 E 1 ){ j 5(H 1 +H 1 ') + 0(1^^)} 



■ 5<Wl 


+ 




1 + a Li ~ 






— E_ I. + — E_ E_ 
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+ ^ <«i 



1 T“»* 



- E) 
n — 



n 



where a = £ h, 

i=l 3 



and B 1 V 2 = ^(I^n 1 E 1 ) { %(H 2 +H 3 ' ) + a(0-E 2 )} 



■ + w 3 ' - a % - sSA - s£iH 3 ' + 



5<W 2 + ^—3 ' 



aE 2 



&2 - 5K» 3 ' + ^2> 



■ - n— 2^ 
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Thus 



BY = 



— 1—1 
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0 





A<«2-fl2> \ 
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0 / 



and 



(BY) (BY) 



liXi 
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hh 
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0 , 




o 
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(BiY,) 2 


(b 1 v 1 )(b 1 v 2 ) 
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( —1—1 ^ 2 - 
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and 



II - Si + ^ ( «l-fE 1 -±-E 1 H 1 + JL £l E 1 ) 



+ ^ct 1 ” n — 1 " n —1—1 + ^2 —1—1 ) 



+ ^HiHi-fBJa- ♦ 4Si> 



II - ffll + 



+ K<Si‘|El- ff Hi + 



+ n £l ) 

4a n 



I. - - E.. + - - E.. ) 

— 1 n —1 2a — 1 n —1 



e*)®*) - «Ii-f&i> + ^1-1^)) 



x(^(H 2 -|e 2 )J 



A + ^-1^)22 



- - ssr^i-FEiJSe 1 



= {H 0 - - E 0 + #- H 0 - 



E„ - - E, 



2 a —2 n — 2 2a —2 2 an — 2 n — 2 



+ §5- E„ } 
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2 -2 2 an -2 



2 an 



2 -2‘ 



^ ( »2 - 5 £ 2 > 
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Therefore 



( BV ) ( BV ) = 





2a ( -2 n-2 ) 1 


0 


0 

j 



BY 



Since BV is idempotent, X'BX/a has a chi-square distribution 
with n-1 degree of freedom. 

Next, Z_ = (Z^ , Zg > > . . . >Z k ) ' can be expressed as 



Z 

kxl 



C* X 

kx(n+k) (n+k)xl 



where C' = 



( — E , I ) 
n kxn kxk 



_Z has a multivariate normal distribution with mean C'jj = 0_ 
and covariance matrix C’VC as shown below: 



C'jj = ( — E , I ) y = y( n + 1) = 0 
kxn kxk (k+n)xl 



ill 



C'V 



-1 
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E 

kxn 



I ) 
kxk 
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/ (— + h .. )E 
/ n n+1 - 

(— — + h i ^ ) E 

v n n+2 — 



\ n 



(=# + h n+k )E 



lx(n+k) 



I { 



+h^)E , 
kxl 



^ n h 2 +h 2 ) - ; 

kxl 






n+k +h n+k ) — 
kxl 



+ a( 



^1 

n 



E 

kxn 



I ) - a( -fn + 1) E 
kxk • kxn+k 
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1 

2 



>E\ 



( if + Vi 

( TT + h n+2 



+ a( — E , I ) 
n — * — 
kxn kxk 



(4.7) 



( ir + Vk 



>* / 



lxn+k 



/< ir + 1 WM 



C'VC = 



< I 



( 1? + h n+2>* 



+ o(~ E , I ) V 

kxn kxk 



\ (— + h ,, )E / 

^ v n n+k — / 

lx(k+n ) 



/=i 



n 



E 

nxk 



» 

I 

kxk 



+ (-^r+h , , ) ]E 



n n+1 n 



n n+1 — 



CC^+h ^)(=±) + (•— +h^ , „ ) ]E 



■ -n 



-a , 



n n+2 n 



n n+2 — 






n n+k' 



lxk 



+ a(— =«- E E + I ) 
n kxk kxk kxk 



^3 



a( — E + I ) 
n kxk kxk 



= a 
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n 



1 

n 



1 

n 

1+1 

n 



1 

n 



1 

n 

1 

n 



1 

n 

1 

n 

• 

1 

n 



1 1+1 

n n 



Therefore, Z ~>N(0_, a ( pj-E + I_ ) ) 

Also, Z = C_'X and X'I3X are statistically independent 
(see Theorem 4) 



^!T +h n+l^ - 



C ' VB = 



I? 



<>„«> £ 



+ «< if £ • 

kxn 



\(= a < 



(— +h .. ) E 
n n+k — 



lx (k+n ) 



I - 
nxn 


n x E 
nxn 


0 


0 


0 



since 



I ) 

kxk 
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Thus , 
with mean 
of S 2 . 




lxn 



= 0 

each Z ± , i=l , 2 , 3 3 • • • > k j is normally distributed 
0 and variance a(l + 1/n) and is independent 
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Let Z^ ' , 1=1, 2, 3, . . • ,k, be the standardized variables 
defined by 



7 1 = 



Z ± - 0 



{a(l + 



X n+1 - X 
<a(l + ±)) h 



the variables 



T i 



V 



(n-l)S* 
a(n-l ) 



X , , - X 
n+i 



+ 



S(1 + i)‘ 



i=l,2,3, . . . ,k 



are jointly distributed according to the multivariate 
generalization of the Student t-distribution with n-1 degree 
of freedom and correlation matrix E defined by 
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To find a two-sided lOOr# simultaneous prediction 
interval to contain each of k additional observations, let 



U be such that 



U U U 



r 




dt-jdt2 . . . dt^ 



(4.8) 



Then 



Pr{-U < < U and 



. . . , and -U < < U} 

sa+^-r 2 



r 




The resulting 100r% simultaneous prediction interval to 

contain the values X ,X ,„,X X , , of all k future 

n+± 5 n+2 J n+3 n+k 

observations is 



For selected values of r, the values of U to satisfy the 
equation (4.8) were tabulated by Hahn and are available 
in [4]. 

C. NUMERICAL EXAMPLES 

Based upon a random sample of observations from a normal 
distribution whose mean and standard deviation are unknown, 
the following data is obtained. 



x ± u(i + -) 1 " 2 S 
n 



(4.9) 



51.4, 49.5, 48.7, 49.3 and 51.6 
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Prom the data, the sample mean X and sample standard 
deviation S are calculated as 

X = (51.4 + 49.5 + 48.7 + 49.3 + 51.6 )/5 = 50. 10 

and S 2 = { (51. 4-50. I) 2 + (49. 5-50. I) 2 + (48. 7-50. I) 2 
+ (49. 3-50. I) 2 + (51. 6-50 . l) 2 }/5 
= 6.9/5 = 1.38 

S = 1.175 

Then, a two-sided prediction interval to contain a single 
future observation X n+1 with 95$ probability is (see 
equation (4.4): 

For n=5, r=0.95, from the Student’s t-tables 
t(4, 0.975) = 2.776. Substituting the observed values 
in (4.4) a 95$ prediction interval for X n+ ^ a future 
observation is given by 
(46.527, 53.673) 

Next, a two-sided 95$ simultaneous prediction interval to 
contain each of 10 future observations is obtained using 
equation (4.9) • 

For k=10, n=5 and r=0.95 from the tables in [4] 

U(l+^-) Js = 5.23. Thus X ± Ud + ^S = 50.1 ± 5.23(1.175) 
and the required prediction interval is given by 
(43.855, 56.145) 
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